Multi-armed bandit

Results: 113



#Item
31Statistics / Machine learning / Multi-armed bandit / Stochastic optimization / Bandit / Variance

Multi-Bandit Best Arm Identification V. Gabillon, M. Ghavamzadeh, A. Lazaric & S. Bubeck Sequel Group Meeting, 21 octobre, 2011. An Example

Add to Reading List

Source URL: victorgabillon.nfshost.com

Language: English - Date: 2011-10-25 11:18:31
32Mathematical optimization / Operations research / Mathematical analysis / Numerical analysis / Convex optimization / Stochastic optimization / Multi-armed bandit / Game theory / Linear programming / AMPL

CSStat 260, Fall 2014: Learning in Sequential Decision Problems Lectures: Evans 334. Tuesday/Thursday 2:00-3:30. Instructor: Peter Bartlett http://www.stat.berkeley.edu/∼bartlett

Add to Reading List

Source URL: www.stat.berkeley.edu

Language: English - Date: 2014-08-28 11:49:09
33Markov models / Markov processes / Stochastic optimization / Mathematical optimization / Operations research / Reinforcement learning / Markov decision process / Algorithm / Multi-armed bandit / Dynamic programming / Shortest path problem / PP

Deterministic MDPs with Adversarial Rewards and Bandit Feedback Raman Arora TTIC 6045 S. Kenwood Ave. Chicago, IL 60637, USA

Add to Reading List

Source URL: dept.stat.lsa.umich.edu

Language: English - Date: 2012-09-12 18:50:24
34Mathematics / Mathematical analysis / Artificial intelligence / Backgammon / Rollout / Markov decision process / Multi-armed bandit / Reinforcement learning / Inverted pendulum / Pendulum / Prime-counting function / Valuation

Rollout Allocation Strategies for Classification-based Policy Iteration Victor Gabillon Alessandro Lazaric

Add to Reading List

Source URL: victorgabillon.nfshost.com

Language: English - Date: 2010-07-01 09:47:14
35Statistics / Probability distributions / Statistical inference / Estimation theory / Machine learning / Stochastic optimization / Bayesian inference / Sampling / Normal distribution / Beta distribution / Multi-armed bandit / Confidence interval

An Empirical Evaluation of Thompson Sampling Lihong Li Yahoo! Research Santa Clara, CA

Add to Reading List

Source URL: papers.nips.cc

Language: English - Date: 2014-02-24 03:34:34
36Matching / Combinatorics / Game theory / Fellows of the Econometric Society / Cooperative games / Stable marriage problem / CC / Reinforcement learning / Multi-armed bandit / Alvin E. Roth / Greedy algorithm / Algorithm

Two-Sided Bandits and the Dating Market Sanmay Das Center for Biological and Computational Learning and Computer Science and Artificial Intelligence Lab Massachusetts Institute of Technology Cambridge, MA 02139

Add to Reading List

Source URL: faculty.chicagobooth.edu

Language: English - Date: 2006-08-08 10:56:19
37

Multi-armed Bandit Problems with History Pannagadatta Shivaswamy and Thorsten Joachims Department of Computer Science, Cornell University, Ithaca NY {pannaga,tj}@cs.cornell.edu 1

Add to Reading List

Source URL: snowbird.djvuzone.org

Language: English - Date: 2011-02-10 15:51:00
    38

    Multi-armed bandit experiments in the online service economy Steven L. Scott December 20, 2014 Abstract The modern service economy is substantively different from the agricultural and manufacturing economies that precede

    Add to Reading List

    Source URL: faculty.chicagobooth.edu

    Language: English - Date: 2015-01-20 12:35:42
      39

      Adaptive Algorithms for Fixed-Cost Multi-Armed Bandit Problems with Budget Constraints Sandip Sen Anton Ridgway

      Add to Reading List

      Source URL: swarmlab.unimaas.nl

      Language: English - Date: 2014-03-15 02:54:22
        40

        Multi-Armed Bandit Models for 2D Grasp Planning with Uncertainty Michael Laskey1 , Jeff Mahler1 , Zoe McCarthy1 , Florian T. Pokorny1 , Sachin Patil1 , Jur van den Berg4 , Danica Kragic3 , Pieter Abbeel1 , Ken Goldberg2

        Add to Reading List

        Source URL: www.ieor.berkeley.edu

        Language: English - Date: 2015-08-31 02:12:22
          UPDATE